Learning Policies for First Person Shooter Games Using Inverse Reinforcement Learning
نویسندگان
چکیده
The creation of effective autonomous agents (bots) for combat scenarios has long been a goal of the gaming industry. However, a secondary consideration is whether the autonomous bots behave like human players; this is especially important for simulation/training applications which aim to instruct participants in real-world tasks. Bots often compensate for a lack of combat acumen with advantages such as accurate targeting, predefined navigational networks, and perfect world knowledge, which makes them challenging but often predictable opponents. In this paper, we examine the problem of teaching a bot to play like a human in first-person shooter game combat scenarios. Our bot learns attack, exploration and targeting policies from data collected from expert human player demonstrations in Unreal Tournament. We hypothesize that one key difference between human players and autonomous bots lies in the relative valuation of game states. To capture the internal model used by expert human players to evaluate the benefits of different actions, we use inverse reinforcement learning to learn rewards for different game states. We report the results of a human subjects’ study evaluating the performance of bot policies learned from human demonstration against a set of standard bot policies. Our study reveals that human players found our bots to be significantly more human-like than the standard bots during play. Our technique represents a promising stepping-stone toward addressing challenges such as the Bot Turing Test (the CIG Bot 2K Competition).
منابع مشابه
Learning to be a Bot: Reinforcement Learning in Shooter Games
This paper demonstrates the applicability of reinforcement learning for first person shooter bot artificial intelligence. Reinforcement learning is a machine learning technique where an agent learns a problem through interaction with the environment. The Sarsa( ) algorithm will be applied to a first person shooter bot controller to learn the tasks of (1) navigation and item collection, and (2) ...
متن کاملRETALIATE: Learning Winning Policies in First-Person Shooter Games
In this paper we present RETALIATE, an online reinforcement learning algorithm for developing winning policies in team firstperson shooter games. RETALIATE has three crucial characteristics: (1) individual BOT behavior is fixed although not known in advance, therefore individual BOTS work as “plugins”, (2) RETALIATE models the problem of learning team tactics through a simple state formulation,...
متن کاملImplementation and Initial Experience Using a Web-Based, Rapid-Fire Teaching System with Game-Like Elements for Chest
Reinforcement learning theory posits that a learner's decision gains value proportional to the discrepancy between the predicted and the actual outcome in reward or punishment (1,2). Rapid-reinforcement feedback mechanisms also contribute to an experiential optimum known as “flow,” an “overwhelming proportion of [which occur] within sequences of activities that are goal-directed and bounded by ...
متن کاملApprentissage par renforcement factorisé pour le comportement de personnages non joueurs
In this paper, we apply a general reinforcement learning method to automatically design the behavior of non player characters of the Counter-Strike first person shooter computer game. The result of the learning process is a set of decision trees that represents compactly and easily readable a model of the problem itself and the decision policy of characters. Beyond this example, we discuss the ...
متن کاملTraining Agent for First-person Shooter Game with Actor-critic Curriculum Learning
In this paper, we propose a new framework for training vision-based agent for First-Person Shooter (FPS) Game, in particular Doom. Our framework combines the state-of-the-art reinforcement learning approach (Asynchronous Advantage Actor-Critic (A3C) model [Mnih et al. (2016)]) with curriculum learning. Our model is simple in design and only uses game states from the AI side, rather than using o...
متن کامل